M 


?> 

o 

oo 

o 

<1 

o 


THE  CHI- SQUARED  PROCESS 
WITH  APPLICATIONS  TO  HYPOTHESIS  TESTING 
AND  TIME  SERIES  ANALYSIS 


Robert  B.  Davies 


May  1980 


!  STATISTICAL  LABORATORY 

University  of  California 


a. 

o 

C-5 


DISTRIBUTION  STATEMENT  A 
Approved  lot  public  release; 
Distribution  Unlimited  _ 


DTIC 

Selected 

MAY  2  2  1980  9  I 


80  5  21  023 


fflCLASSTO 

SECURITY  CLASSIFICATION  OP  THIS  PAGE  <T7l.n  D.l.  Snl.r.O) 

‘  REPORT  DOCUMENTATION  PAGE 

r  report  NUMBER  2.  GOVT 

OMR,  80-03  ^ 

At.  TLTjLE_X5n^5ji4ita«i  .  -  - . -  -  . 


2.  GOVT  ACCESSION  NO 


.  j  Hie  Chi-Squared  Process  with  Applications  to 
\  Hypothesis  Testing  and  Time  Series  Analysis^ 


17.  author^ 


j  (j  ]  Robert  B.  /Davies  I 


9.  PERFORMING  ORGANIZATION  NAME  AND  AOORESS 

Statistical  Laboratory 
Department  of  Statistics 

University  of  California,  Berkeley,  CA  94720 

M.  CONTROLLING  OFFICE  NAME  ANO  AOORESS 

(ll  N 

Office  of  Naval  Research  KS 

Washington,  D.C.  20Q14 

1 4.  MONITORING  AGENCY  NAME  A  ADDRESS^//  different  from  Controlling  Office) 

r/A  i.  -  ^-4-  '■  W 


116.  OISTRI8UTION  STATEMENT  fa/  Ul /*  Report) 


READ  INSTRUCTIONS 
_ BEFORE  COMPLETING  FORM 

J  RECIPIENT'S  CAT  ALOG  NUMBER 


S.  TYPE  OF  REPORT  *  PERIOO  COVERED 

Scientific 

t.  PERFORMING  ORG.  REPORT  NUMBER 

CNR  80-Q3  ^ 

t.  COM^AACT  OR  GRANT  NUMBER/*./ 

(7^—— - 

2N0fefQ.4  -7  5-  C-0 1 59  _ 

SRAM  ELEMENT.  PROJECT.  TASK 
AREA  A  DORK  UNIT  NUMBERS 


<m£] 


is.  NUMBER  OF  PAGES 

22 

15.  SECURITY  CLASS,  (of  thfe  import) 


IS*.  DECLASSIFICATION/ OOWNGRAOING 
SCHEDULE 


This  document  has  been  approved  for  public  release;  its  distribution  is 
unlimited. 


I  17.  DISTRIBUTION  STATEMENT  ( oi  the  abetted  entered  In  Block  20,  It  different  from  Report) 


j  IS.  SUPPLEMENTARY  notes 


IS.  KEY  WORDS  (Continue  on  reeeree  aide  ft  neceeamry  m%d  identify  by  block  number) 

Discrete  frequency /Maximum/ Nut s ance  parameter/Periodogram/Spectral  analysis/ 
Upcrossing 


I  20.  ABSTRACT  CCeotboum  am  revere*  «Mi  ft  i 


t  identity  by  block  number) 


rS  fD 


~We  calculate  the  expected  number  of  uperossings  of  a  fixed  level  by  the 
process  ^  y  A 


see) J  x*ce>  *  ...  * 

fvr- . 


where  v. .  ,  'Xg  (.*)}'  is  a  multivariate  Gaussian  process  such  that  for 


DO  1473  EDITION  OF  *  NOV  6*  IS  OBSOLETE 


UNCLASSIFIED 

SECURITY  CLASSIFICATION  OF  THIS  PAGE  fRhwl  O.f.  En(w«/J  j. 


'  I 


r 


SECURITY  CLASSIFICATION  or  This  P  AOEflThl  Data  Entarad) 


ABSTRACT  (CCNT.)  'J/fl'P 

--4ach  value  of  9,  the  random  variables  OCjCq) ,  ...  ,  (£S(9)  'are  independent 
with  zero  means  and  unit  variances .  ^ThiS^nables  a  bound  on  the  probabil¬ 
ity  that  the  maximum  of  the  process  [exceeds  a  fixed  level  to  be  obtained. 
The  result  is  used  to  adapt  the  method  of  Davies  (1977)  to  testing  the 
hypothesis  that  a  vector  £  =  0  when  a  nuisance  parameter  9  is  present 
only  under  the  alternative  £  t  0.  The  method  is  then  applied  to  the 
problem  of  detecting  a  discrete  frequency  component  in  a  Gaussian  time 
series . 


Aoaeseian  For 
OTIS  ’ 

noc  m 

Unannounced 

Justification 


Distribution/ _ 

Availability  Codes 

Avail  and/or 
'ist.  special 


DTIC 

ELECTE 


MAY  2  2 1980 


UNCLASSIFIED 


SECURITY  CLASSIFICATION  OR  THIS  P  AOtfW*i»"  Data  Entarad) 


THE  CHI -SQUARED  PROCESS  WITH  APPLICATIONS  TO 
HYPOTHESIS  TESTING  AND  TIME  SERIES  ANALYSIS 

Robert  B.  Davies 

Applied  Maths  Division,  DSIR, 

Wellington,  New  Zealand 

Currently  visiting: 

Statistical  Laboratory 
University  of  California,  Berkeley  94720 

ABSTRACT 

We  calculate  the  expected  number  of  upcrossings  of  a  fixed  level  by 
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see)  -  xj(6)  +  ...  *  x^(e) 

where  {X^(*)>  •••  ,  Xs(0)  is  a  multivariate  Gaussian  process  such  that 
for  each  value  of  9,  the  random  variables  X^(9),  ...  ,  Xs(9)  are  indepen¬ 
dent  with  zero  means  and  unit  variances.  This  enables  a  bound  on  the 
probability  that  the  maximum  of  the  process  exceeds  a  fixed  level  to  be 
obtained.  The  result  is  used  to  adapt  the  method  of  Davies  (1977)  to  test¬ 
ing  the  hypothesis  that  a  vector  £  =  0  when  a  nuisance  parameter  9  is 
present  only  under  the  alternative  £  f  0.  The  method  is  then  applied  to 
the  problem  of  detecting  a  discrete  frequency  component  in  a  Gaussian  time 
series . 
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1.  INTRODUCTION 

Davies  (1977)  considered  the  problem  of  testing  the  hypothesis  that  a 
single  parameter  £  »  0  against  the  alternative  £  >  0  when  a  single 
nuisance  parameter  9  was  present  only  under  the  alternative.  In  this 
paper  we  consider  the  problem  of  testing  the  hypothesis  that  all  components 
of  an  s-dimensional  vector  £  are  zero  against  the  alternative  that  at 
least  one  of  them  is  non- zero,  where,  again,  there  is  a  nuisance  parameter, 
9,  which  enters  into  the  model  only  when  £  is  non- zero.  As  in  Davies 
(1977)  standard  asymptotic  methods  cannot  be  applied  directly.  However, 
by  using  them  or  some  other  method,  it  may  be  possible,  for  each  9,  to 
find  a  random  variable  S(8)  which  has,  at  least  asymptotically,  a  chi- 
squared  distribution  when  £  *  0  and  which  would  be  suitable  for  testing 
5*0  against  £  f  0  if  9  were  known.  Then  the  test  we  consider  in 
this  paper  is  to  reject  the  hypothesis  for  large  values  of 

sup  {S(9):L  <  9  <  U>  (1.1) 

where  [L,U]  is  the  range  of  possible  values  for  9.  The  problem  is  to 
find  the  distribution  of  (1.1)  so  that  significance  probabilities  can  be 
calculated. 

To  simplify  the  discussion  we  suppose  that  £  *  0  and  sample  sizes 
are  so  large  that  deviations  from  an  exact  chi-squared  distribution  can  be 
ignored.  Suppose  that  S(9)  can  be  represented 

s (0 )  =  xjce)  +  ...  -  x*(e) 


(1.2) 


where  the  Xj(0)  form  a  multivariate  Gaussian  process  such  that  for  each 
9  the  Xj(9)  are  independent  with  standard  normal  distributions.  Then  we 
will  say  that  S(9)  is  a  chi -squared  process.  This  is  a  generalization  of 
the  process  considered  by  Sharpe  (1978)  since  we  do  not  require  X^(e^) 
and  Xj^)  to  be  independent  if  0^  f  @2  nor  do  we  require  stationarity. 

As  in  Davies  (1977)  we  find  a  bound  on  the  probability  that  sup  {S(e)} 
exceeds  a  given  value  by  finding  the  expected  number  of  upcrossings  of  a 
fixed  level  by  the  process  S(0).  This  is  done  in  section  2;  in  section  3 
the  result  is  applied  to  the  testing  problem. 

An  important  exairple  of  the  test  is  concerned  with  the  detection  of 
frequency  components  in  a  time- series.  Suppose  Zp  ...  ,Z  is  a  sequence 
of  independent  normal  random  variables  with  unit  variances  and 

EZj  *  sin(je)  ♦  c2  cos(je)  (1.3) 

with  0<U<9<L<tt.  That  is,  if  £  f  0,  a  discrete  frequency  component 
is  present  in  the  series.  This  example  is  considered  in  section  4. 

We  use  the  following  notation:  A*  denotes  the  transpose  of  a  vector 
or  matrix.  A;  1  denotes  an  indicator  function. 
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2.  UPCROSSINGS  OF  A  CHI -SQUARED  PROCESS 


We  suppose  that  X(9):9€[L,U]  is  an  s-dimensional  Gaussian  process, 
with  zero  expectation  and  a  continuous  derivative  Y(9)  =  dX(8)/d8.  We 
further  suppose  that  for  each  0  the  components  of  X(8)  are  independent 
with  unit  variance.  Then  X(9)  and  Y(8)  are  jointly  normally  distributed, 

Y(0)  also  having  zero  expectation.  Suppose 


Var 


f  1 

X(0) 

f  I  A(9) 

Y(0) 

|A*(9)  B  (9) 

(2.1) 


Then  A(0)  is  skew- symmetric  since 


A(0)  +  A*(8)  -  lim  E[X(8){X(0  +  A)  -  X(0) }* 

A-K) 

+  {X(8  +  A)  -  X(0) }X*(9  +  A)]  /A 


=*  0. 


Let  S(9)  =  X*(9)X(0)  as  in  (1.2), 

T(9)  =  dS(0)/d8  =  2X*(8)Y(8)  (2.2) 

and  let  f ( •)  denote  the  density  of  S(9),  a  chi-squared  variable  with  s 
degrees  of  freedom. 

Then  for  u  >  0  one  can  check  that  the  conditions  of  Marcus  (1977) , 
section  5,  are  satisfied.  Following  Sharpe  (1978),  the  expected  number  of 
upcrossings  of  the  level  u  is  given  by 


2-2 


<H0)de 


(2.3) 


where  i|»(0)  =  ECTCe)!^^  (  S(6)  =  u)f(u) 


(2.4) 


In  (2.2)  Y (9)  can  be  replaced  by  Y(0)  -  A*(0)X(0)  in  which  case 
(2.1)  must  be  replaced  by 


X(0) 

_  h  o 

Y(  0)  -  A*(  0)  X(  0) 

jo  Bfe)  -  A*(e)A(e) 

Var 

Now  choose  U(0)  orthogonal  so  that 
U(0) C3(e)  -  A*(0)A(0) }U*(0)  =  A(9) 


(2.  S) 


(2.6) 


where  A(0)  is  diagonal.  Premultiplying  X(0),  Y(0)  by  U(0)  will  not 
affect  (1.2)  and  (2.2)  and  so,  in  these  equations,  we  can  assume  that  the 
components  of  X(0),  Y(0)  are  independent,  those  of  X(e)  having  unit 

variance  and  those  of  Y(0)  having  variances  given  by  the  elements  of 
A(0),  the  eigenvalues  of  B ( 0)  -  A*(0)A(e). 

Before  proceeding  with  the  evaluation  of  (2.4)  we  note  how  A( e)  may 
be  found  from  the  covariance  function  of  X(0).  Let 


Rqa  =  E{X(0  +  A) X*(0) } 


(2.7) 


RgA  -  I  -  E({X(0  +  A)  -  X(0)}X*(0)1  =  AA*(0)  +  o(A) 


21  *  R0A  *  R0A  =  E[{X(Q  +  A)  *  x(0)  HX(0  +  A)  -  X(0)}*]  =  A2B(0)  +  o(A2) 


Then 
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Now 


1*94  «P<  -  1sCRBa  -  Re?)  ) 


■ {r  •  Wm  -  "oP  *  «V  *  \l  -  2I» 

•{I  -  -  *4>  *  Xa  -  R9?2)  *  °(“2) 

=  I  -  %A2{B(0)  -  A*(0)A(0)}  +  o(A2) 


But  exp{  -  ^(R  -  Rq*) }  is  orthogonal, 
tion  of  Rqa  can  be  expressed  as 

I  -  *5A2A(6)  +  o(A2) 


Hence  a  singular  value  decomposi- 


(2.8) 


This  provides  the  alternative  way  of  finding  A(e) . 

Returning  to  (2.4),  dropping  references  to  0  and  supposing 

0 
A 


(2.9) 


we  have 

H>  -  E(T1t>0|S  =  u)f(u) 

-  2E{E(X*Ylx*Y>0jX*X,Y) |X*X  »  u}f(u) 


Now 

E(X*Y1X*Y>0!X*X,Y)  *  Cy(X*X)4 

for  some  Cy  which  may  depend  on  Y  but  not  on  X*X  since  X  and  Y 
are  independent  and  the  conditional  distribution  of  X  given  X*X  is  just 
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r 

l 


uniform  on  the  surface  of  the  sphere  of  radius  (X*X)  .  To  evaluate  Cy 
take  expectations: 

e(™x*y>o|y)  = 

On  the  left  hand  side,  rotate  X  so  that  X^  lies  in  the  direction  of  Y 
to  obtain 

E(X11X1>0}  *(Y*Y)"2  =  (2ir)“,*CY*Y)Js 

Combining  these  results  and  substituting 

f(u)  =  us/2-VU/2  /  {r(s/2)2s/2} 

E(X*X)%  =  2j'jr{(s  +  l)/2}  /  r(s/2) 

we  find 

u(s-l)/2  e-u/2 

■K9)  =  E{Y*(9)Y(9)}i5  ,  (2.10) 

ls/l  r{(s  +  l)/2} 

where  Y ( 9)  is  a  vector  of  independent  centered  normal  random  variables 
with  variances  given  by  the  elements  of  A(8) . 

Hence  we  have 

Theorem  2.1  If  the  s- dimensional  Gaussian  process  X(0)  has  continuously 
differentiable  sample  paths  for  L  <  9  <  U  and  E(X(0))  =  0,  Var(X(0)}  =  I 
for  each  9  then  the  expected  number  of  upcrossings  of  the  level  u  in  the 
range  9^[L,U]  is  given  by  (2.3)  where  ti»(0)  is  defined  by  (2.10)  and  Y(0)  j 
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is  composed  of  s  independent  centered  normal  random  variables  with  variances 
A^(0),  ...  Xs(0)  being  the  eigenvalues  of  B(0)  -  A*(e)  A(e) ;  A(e)  and 
B(0)  being  as  in  (2.1). 


Corollary  2.2. 

pr[sup{X(0)  ;L  <  0  <  U>  >  u] 


fU 


ipC8)de  +  pr(x^  >  u) 


(2.11) 


where  Xg  denotes  a  chi- squared  random  variable  with  s  degrees  of  freedom. 

The  corollary  is  proved  in  the  same  way  as  formula  (3.6)  in  Davies 
(1977) .  Sharpe  (1978)  has  shown  that  the  number  of  high  level  upcrossings 
in  the  stationary  independent  case  is  approximately  Poisson  and  we  would 
expect  this  to  be  true  more  generally.  Hence,  as  in  Davies  (1977),  we  would 
expect  the  bound  (2.11)  to  be  reasonably  sharp. 

It  remains  to  consider  the  calculation  of  E{(Y*Y)^}.  Harvey  (1965) 
gives  the  general  formula 


t°° 


E{(Y*Y)'*-}  = 


2tt  ’ 


p(t) 


sin  F(t)  -  cos  F(t) 
t3/2  p(t) 


dt 


(2.12) 


where 


s 

F(t)  =  42  arctan  (x^t) 
1  1 


p(t)  =  n  Cl  +  a?  t2) 

l  x 


Formula  (2.12)  can  also  be  expressed 


E  ( (Y*Y)  =  (2tt)*4 


r  00 

a 

Jo 


n  (l  *  \,tyh}^5/Zdt. 
j=l  J 
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Alternatively,  Harvey  (1965)  gives  a  variety  of  approximate  expressions. 

We  consider  two  special  cases.  Suppose  all  the  A ^  are  equal  with 
common  value  X.  Then 

El^Y)3*}  =  (2X)h  r{(s  +  l)/2}  /  r(s/2) 

leading  to  a  result  in  agreement  with  Sharpe's  (1978)  formula  (3.2).  If 
s=2  then  E((Y*Y)^}  can  be  evaluated  directly  (Harvey,  1965)  as 

E((Y*Y)^}  =  (2X1/m)i5  E(1  -  X^X-J  if  Xx  >  X2 

=  (2\2/u)H  E(1  -  X1/X2)  if  X2  >  Xx  (2.13) 

where,  on  the  right  hand  side,  E  denotes  a  complete  elliptic  integral  of 
the  second  kind;  see  Abramowitz  and  Stegun  (1970),  formula  17.3.3. 


5 


I 

1 
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3.  HYPOTHESIS  "TOTING 

Suppose  that  the  outcome  of  an  experiment  is  represented  by  a  (vector) 
random  variable  Z,  the  distribution  of  which  depends  on  parameters 
• • •  »  5S  and  9  where  0  is  known  to  lie  in  the  closed  interval 
[  L,U1 ,  and  that  we  wish  to  test  the  hypothesis  that  all  the  ^  are  zero 
against  the  alternative  that  at  least  one  of  them  is  non- zero.  Now 
suppose  that  the  distribution  of  Z  does  not  depend  on  9  when 
5  =  (£p  . . .  ,  5  )*  =  0  so  that,  for  example,  standard  asymptotic  methods 
cannot  be  applied  directly.  However,  if  0  was  known,  it  might  be  possible 
to  find  a  test  which  rejected  the  hypothesis  5  =  0  for  large  values  of  the 
statistic 

see)  *  xj(0)  ♦  ...  ♦  Xg(8)  (3.1) 

where  the  (0)  were,  for  each  0,  at  least  approximately  independently 
normally  distributed  with  unit  variances  and  zero  means  when  £  =  0.  For 
example,  if  5(0)  represented  the  maximum  likelihood  estimator  for  5 
when  0  was  given  and  E(9)  its  asymptotic  variance  matrix  one  might  take 

X(0)  =  £‘^(9)5(9). 

Alternatively  one  might  derive  a  similar  formula  from  formula  (55)  of  Buhler 
and  Puri  (1966) . 

The  test  suggested  here  is  to  reject  the  hypothesis  5=0  if 


sup  (S(0) :L  <  9  <  U)  >  u. 


(3.2) 
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Formula  (2.11)  can  then  be  used  to  give  an  upper  bound  on  the  probability 
of  type  I  error  or  on  the  significance  probability. 

One  can  also  find  a  lower  bound  on  the  power  using  the  formula 

pr  [{sup  S(e):L  <  9  <  U}  >  u]  <  pr_  .  (S(0  )  >  u}  (3.3) 

0 

0  0 

which  we  would  be  able  to  evaluate  using  a  non-central  chi-squared  distribu¬ 
tion.  The  analogous  bound  of  Davies  (1977)  was  found  to  provide  an  adequate 
approximation;  however  no  studies  have  been  carried  out  in  the  present 
instance. 

We  now  consider  a  particular  non- asymptotic  situation  in  which  it  is 
possible  to  simplify  the  finding  of  the  eigenvalues  required  by  Theorem  2.1. 

Suppose  we  observe  Z  =  (Zp  ...  ,  Z^)*  where  the  Z^  are  indepen¬ 
dently  normally  distributed  with  unit  variances  and  with 

EZ  =  W*(e)e  (3.4) 

where  W(q)  is  an  s  x  n  dimensional  matrix  of  rank  s.  Then  if  q 
were  known  the  most  stringent  test  for  testing  £  =  0  against  5  +  0 
rejects  the  hypothesis  for  large  values  of 

S(0)  =  Z*Q(0)Z  (3.5) 

where 

Q(q)  =  w*(e){W(e)w*(0)}'1w(9). 


Factorize 
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Q(6)  =  U*(0)U(3)  (3.6) 

where  U(8)  is  an  s  x  n  matrix  with  U(8)U*(0)  *  I.  Then  letting 
X(8)  =  U(8)Z  puts  the  problem  in  the  form  already  discussed.  Then 
Y  =  {dU(9)/de)Z  and  so 


A(0)  =  U(8){dU*(8)/d0} 

B(8)  =  {dU(8)/d0}{dU*(e)/de}. 

It  will  be  convenient  to  write  F  =  G  if  matrices  F  and  G  have 
the  same  non- zero  eigenvalues.  In  particular  if  the  products  FG  and  GF 
are  defined  FG  =  GF.  We  need  to  find  the  eigenvalues  of 


B  -  A*A  *  B  +  A2  +  (A*)2  +  AA* 


=  If* 


du  dU*  n  dU*  „  dU*  dU  m  dU  .  n  dU*  dU 
MM  U  ^  U  TT7T  +  ^3-  u*  |  U 


M  M  M  u  M 


M  M 


=  Q 


Q. 


(3.7) 


If  we  write 

R(0)  =  W*(0){W(0)W*(0)}'1(dW(e)/de) 


one  can  check  that 


Q 


2 

Q  -  R(I  -  Q)R 


-  w^ovw*)'1  H  {i  .  w*(ww*)_1w}  (w*yLw 
=  (m*yh  ^  -  SjjJ  m  (WW*)'1  w  1^}  (WW*)'4 


(3.8) 


(3.9) 
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Thus  the  non- zero  elements  of  A(8)  can  be  found  as  the  non- zero  eigen¬ 
values  of  (3.7),  (3.8)  or  (3.9).  Formula  (3.9)  is  particularly  convenient 
if 

WW*,  (dW/dej (dW*/d6)  and  (dW/de)W*  (3.10) 

are  all  diagonal  because  we  can  set 

A  *  (WW*)'1  -  (WW*)'2  ^  W*)2.  (3.11) 


4.  DETECTION  OF  A  DISCRETE  FREQUENCY  COMPONENT 

We  observe  Z  =  (Z^,  ...  ,  Zn)*  where  the  Zj  are  independently  nor¬ 
mally  distributed  with  unit  variances  and  with 

EZj  =  ^  sin  [{j  -  (n+l)/2)e]  +  cos  Kj  *  (n+l)/2}e].  (4.1) 

That  is  Z^,  ...  ,  Zn  is  a  sequence  of  independent  standard  normal  variables 
on  to  which  has  been  superimposed  a  cyclic  effect  with  period  2  rr/  0.  Formula 
(4.1)  is  just  a  change  of  parameterization  of  (1.3).  Now  suppose  we  wish  to 
test  the  hypothesis,  5=0,  that  is  there  is  no  frequency  component,  against 
the  alternative  that  5^0.  Traditionally,  see  Hannan  (1960),  pages  76-83, 
this  problem  has  been  handled  by  looking  at  only  values  of  0  of  the  form 
2-rrk/n:k  =  1,  ...  ( n/2  J.  For  these  values  of  6  the  corresponding  values  of 
(3.5)  will  be  independent  and  so  significance  levels  can  be  readily  calcula¬ 
ted.  However  a  loss  of  power  occurs  if  the  true  value  of  0  falls  between 
these  values.  We  should  emphasize  that  we  are  concerned  with  discrete  fre¬ 
quency  components.  The  method  considered  here  has  little  relevance  to  the 
problem  of  detecting  peaks  in  the  frequency  spectrum  which  have  a  bandwidth 
greater  than  2ir/n  cycles  per  sampling  interval. 

Now  apply  the  theory  of  the  previous  section.  The  matrices  (3.10)  are 
derived  in  the  appendix  and  all  turn  out  to  be  diagonal  so  (3.11)  applies. 
Applying  (3.5)  and  (4.1)  we  find 
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S(8)  = 


n 

E  Z-  sin [{j  -  (n+l)/2}0] 
1  J 


2 

/  V1 


+ 


n 

E  Z.  cos  [{j  -  (n+l)/2}e] 
1  ■> 


2 

/  v2 


(4.2) 


where 


v1  =  {n  -  sin(ne)  /  sin(e)>  /  2 
v2  =  {n  +  sin(ne)  /  sin(e)}  /  2 

For  the  moment  we  suppose  O<L<0<U<TT  so  that  S(0)  is  defined. 

Then  it  is  shown  in  the  appendix  that  the  eigenvalues  and  A2  can  be 

expressed 

(n2  -  1)/(3G)  -  n2/4  ♦  (1  -  F2/G2)/(4  sin2  9)  (4.3) 


where 


F  =  cos(nQ)  -  o  cos  0 

G  ■  1  -  o  sin(ne)/(n  sin  9) 

a  »  +  1  to  give  x1  ,  -1  to  give 

For  each  9  the  value  of  E  {(Y*Y)  }  =  a (8),  say,  can  be  found  from 
(2.13)  and  so  the  bound  (2.11)  is  equal  to 

a(0)  d0  .  u4  e'u/2  /  7r  +  e'u^2.  (4.4) 

JL 
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In  fact  (4.3)  tends  to  zero  as  9  tends  to  0  or  it  so,  provided  (4.2) 
and  (4.3)  are  defined  by  continuity  at  9=0  or  ir,  we  can  allow  9  to  take 
any  value  in  the  range  (0,irl.  For  n  large  and  9  not  near  0  or  it 
formula  (4.3)  can  be  approximated  by  n  / 12  leading  to  a (9)  in  (4.4)  being 
approximated  by  n(7r/24)  which  in  turn  leads  to  (4.4)  being  approximated  by 

n  u2  e‘u/2  (U-L)  /  [2^)h  +  e‘u/2  (4.5) 

The  approximation  of  a(9)  by  n(7r/24)  turns  out  to  be  adequate  if  n  >  4 
and  2ir/n  <  9  <  ir  -  2iT/n.  This  approximation  is  also  good  for  moderate  and 
large  values  of  n  when  9  covers  the  whole  range  0  <  9  <  ir.  In  Table  I 
the  values  of  / q  a(0)  d9  /  ir  and  its  approximation  n(ir/24)'2  are  listed 
for  various  values  of  n.  It  will  be  noted  that  the  approximation  is  espe¬ 
cially  good  if  the  approximate  value  is  reduced  by  .5. 

To  test  the  sharpness  of  the  bound  (4.4)  on  the  significance  probability 
a  simulation  was  carried  out  with  n  =  16.  One  thousand  simulations  were 
performed.  The  function  S(9)  was  examined  at  255  points  and  the  number 
of  times  the  hypothesis,  £  =  0,  was  rejected  at  the  .2,  .1,  .05,  .02  and  .01 
levels  counted  together  with  the  total  number  of  upcrossings.  The  results 
are  listed  in  Table  II.  In  each  case  the  number  of  upcrossings  was  only 
slightly  above  the  number  of  significant  results  indicating  that  the  nominal 
significance  level  was  close  to  the  actual  significance  level.  Other  simu¬ 
lations  with  n  *  4  and  n  *  64  yielded  similar  results.  The  simulations 
were  carried  out  on  the  CDC  6400  computer  on  the  Berkeley  campus  of  the 
University  of  California  and  used  the  IMSL  subroutines  GGNML  and  FFTRC. 
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Formula  (4.5)  can  be  compared  with  the  formula  one  obtains  when  one 
considers  the  test  based  on  the  maximum  of  *  S(2irk/n)  where  k  is  an 
integer.  Assuming  U  and  L  to  be  of  the  foim  (2k+l)m/n 

P  {  supCS^.’L  <  2Trk/n  <  U)  >  u} 

=  i  -  a  -  e~u/2)n(-*J'L^/(-2lr^ 

~ne'u/2  (U-L)  /  (2rr) .  (4.6) 

In  fact,  there  is  relatively  little  difference  between  the  values  of  u  re¬ 
quired  to  give  the  same  value  to  formulas  (4.5)  and  (4.6)  and  thus  there  is 
only  a  small  loss  of  sensitivity  when  the  test  based  on  the  maximum  of  the 
is  replaced  by  the  test  considered  in  this  paper  and  8  is  of  the  form 
2vk/n.  On  the  other  hand  for  values  of  9  of  the  form  (2k+l)ir/n  our 
test  has  a  substantial  advantage. 

In  practice,  of  course,  one  would  need  to  normalize  the  time-series 
(Zp  ...  Z  )  by  subtracting  off  the  sample  mean  and  dividing  by  the  sample 
standard  deviation.  It  may  also  be  necessary  to  condensate  for  serial  corre¬ 
lation  by  fitting  a  simple  autoregressive  or  moving  average  process.  Sub¬ 
tracting  the  sample  mean  would  have  a  major  effect  only  at  the  very  low  fre¬ 
quency  end  of  the  spectrum  and  hence  for  large  n  formula  (4.5)  would 
still  be  applicable.  On  the  other  hand  if  we  denote  (4.5)  by  a(u)  and 
differentiate  we  find  da(u)/du  =  0(a(u)).  Also  u  =  0(log  n) .  Since  the 
standard  deviation  and  any  autoregressive  or  moving  average  parameters  would 
be  estimated  to  an  accuracy  of  0(n'^}  the  effect  on  the  significance  level 
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would  be  similar  to  that  of  changing  u  by  0(un  )  =  0(n  log  n)  which 
would  be  asymptotically  negligible.  Hence,  for  large  n,  formula  (4.5)  would 
still  be  applicable  when  applied  to  the  suitably  pre-whitened  normalized,  and 
centered  time-series. 

According  to  Feller  (1970),  section  26.7,  theorems  1  and  3,  the  central 
limit  theorem  holds  for  moderate  deviations  of  o(n^)  and  hence  we  would 
expect  (4.5)  to  hold  asymptotically,  even  if  the  Zj  were  not  normally  dis¬ 
tributed.  However  further  study  is  required  on  this  point. 
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TABLE  I 


n 

/g  a(8)  de  /-IT 

n(ir/24)’5 

5 

1.26 

1.81 

10 

3.09 

3.62 

15 

4.91 

5.43 

20 

6.72 

7.23 

25 

8. S3 

9.05 

30 

10.34 

10.85 

40 

13.96 

14.47 

50 

17.58 

18.09 

60 

21.20 

21.71 

80 

28.44 

28.94 

100 

35.68 

36.18 

TABLE  II 

Simulation  of  the  Test  for 

Frequency 

Components 

Nominal  Significance  Level 

.2 

.1 

.05 

.02 

.01 

Critical  level,  u 

8.848 

10.385 

11.901 

13.879 

15.362 

Expected  number  of  upcros sings 

200 

100 

50 

20 

10 

Observed  number  of  upcrossings 

221 

114 

57 

21 

10 

Nunber  of  significance  tests 

198 

107 

56 

20 

9 
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APPENDIX 

Derivation  of  Formula  (4.3) 


Using  the  notation  of  section  3  and  letting  m  =  (n+l)/2 


W  =  sin{(l-m)9}  sin{(2-m)e}  ...  sin{(n-m)e} 


cos{(l-in)8)  cos{(2-m)0}  ...  cos{(n-m)e} 


All  the  off-diagonal  elements  of  the  matrices  (3.10)  are  zero  since  the  k-th 
and  (n+l-k)-th  terras  in  the  sums  that  form  these  elements  cancel.  The  dia¬ 
gonal  elements  of  WW*  may  be  expressed 


n  2  n  2 

Z  s:m  {(k-m)9}  and  z  cos4{(k-m)e} 
1  1 


(A.l) 


which  are  equal  to  %nG  where  G  is  as  in  (4.3)  with  a  -  +1  for  the  first 
term  and  -1  for  the  second  terra.  The  diagonal  elements  of  (dW/de)W*  are 
given  by 


n  n 

2  (k-m)  sin{(k-ra)0}  cos{(k-m)9}  and  -Z  (k-m)  {sin{(k-m)0}  cos{(k-m)0} 

1  1  (A.  2) 

and  may  be  evaluated  by  differentiating  (A.l)  and  dividing  by  2.  Hence  they 

are  equal  to  -n(G  cos  9  +  aF)  /  (4  sin  8)  with  F  and  G  as  in  (4.3)  and 
a  as  before.  The  diagonal  elements  of  (dW/de) (dW*/d8)  are  given  by 


I  (k-ra)2  cos2{(k-m)0)  and  z  (k-m)2  sin2{(k-m)0} 


(A.  3) 


Sunning  these  two  terms  we  obtain 


Z  (k-m)2  -  n(n2-11  /  12 
1 

while  the  difference  may  be  found  by  differentiating  (A.  2) : 

Z  (k-m) 2 [cos 2 {(k-m) 9}  -  sin2{(k-m)e}] 

1 

?  2  2  2  2 
=  h  an{(n  -1)  +  (G  +  G  cos  9  +  2  aF  cos9  -  n  G  sin  9)  /  sin  9} 

for  a  *  +1  or  -1.  Thus  (A.  3)  is  equal  to 

n(n2-l)/6  +  n(G  +  G  cos29  +  2aF  cos9  -  n2G  sin2g)  /  (8  sin2g) 

with  a  *  +1  for  the  first  term  and  -1  for  the  second  tern.  This  completes 
the  derivation  of  the  matrices  (3.10).  Evaluation  of  (3.11)  results  in 
formula  (4.3). 
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